Where Do Rewards Come From?

نویسندگان

Satinder Singh

Ann Arbor

Richard L. Lewis

Andrew G. Barto

چکیده

Reinforcement learning has achieved broad and successful application in cognitive science in part because of its general formulation of the adaptive control problem as the maximization of a scalar reward function. The computational reinforcement learning framework is motivated by correspondences to animal reward processes, but it leaves the source and nature of the rewards unspecified. This paper advances a general computational framework for reward that places it in an evolutionary context, formulating a notion of an optimal reward function given a fitness function and some distribution of environments. Novel results from computational experiments show how traditional notions of extrinsically and intrinsically motivated behaviors may emerge from such optimal reward functions. In the experiments these rewards are discovered through automated search rather than crafted by hand. The precise form of the optimal reward functions need not bear a direct relationship to the fitness function, but may nonetheless confer significant advantages over rewards based only on fitness.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Author-assigned Keywords in Research Articles: Where Do They Come from

متن کامل

Self-regulation of a queue: A tutorial

It is well-known that in a queueing system, customers, who mind only their selfish interests, join a queue at a rate which is higher than it is socially desired. The reason behind that is that when customers assess the costs and rewards associated with joining, they mind only their own, while ignoring the additional costs, known as externalities, they inflict on others due to their joining. A c...

متن کامل

Whither Mental Health Policy-Where Does It Come from and Does It Go Anywhere Useful?; Comment on “Cross-National Diffusion of Mental Health Policy”

Factors influencing cross-national diffusion of mental health policy are important to understand but complex to research. This commentary discusses Shen’s research study on cross-national diffusion of mental health policy; examines the extent to which the three questions researched by Shen (whether countries are more likely to have a mental health policy (a) the earlier a country becomes a memb...

متن کامل

Welcome to virosphere

Viruses may seem alien, but they are the most abundant and, arguably, the most important organisms on Earth. They are found just about everywhere, from oceans and forests to the people around you and, of course, in and on you as well. This world of strange, quasi-living things has been dubbed the virosphere, and it is a mysterious one – we know less about viruses than any other life form. But t...

متن کامل

Bridging the gap between extrinsic and intrinsic motivation in the cognitive remediation of schizophrenia.

An important development in cognitive remediation of schizophrenia is a focus on motivation. However, following a distinction between the concepts of intrinsic motivation (IM) and extrinsic motivation, discussions of IM-based methods have downplayed or misrepresented the role that extrinsic rewards can, and actually do, serve to promote positive treatment outcomes in cognitive remediation. Ther...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2009

Where Do Rewards Come From?

نویسندگان

چکیده

منابع مشابه

Author-assigned Keywords in Research Articles: Where Do They Come from

Self-regulation of a queue: A tutorial

Whither Mental Health Policy-Where Does It Come from and Does It Go Anywhere Useful?; Comment on “Cross-National Diffusion of Mental Health Policy”

Welcome to virosphere

Bridging the gap between extrinsic and intrinsic motivation in the cognitive remediation of schizophrenia.

عنوان ژورنال:

اشتراک گذاری